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FIELD OF THE INVENTION 

[0001] This invention relates to providing a connection 

tool to extract information from an electronic document 
source, for instance an XML source, such as an electronic 
marketplace, and convert it to a compatible format before 
providing it to an analytical application, such as a data 
warehouse system. 

BACKGROUND OF THE INVENTION 

[0002] Many different electronic document sources exist 

these days that exchange or facilitate the exchange of 
electronic documents. One such source is a business-to- 
business electronic marketplace. Such an electronic 
marketplace is typically accessible to businesses through 
computer networks, such as the Internet. Electronic 
marketplaces permit their members to purchase and sell 
products to other members of the marketplace and negotiate 
such transactions without needing to actually talk to the 
other members . Electronic marketplaces are computer 
systems which are generally maintained by: 1) a company for 
internal use amongst business units; 2) a group of 
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companies that are likely to trade amongst themselves; or 
3) a third party hoping to attract members to the 
marketplace. These systems greatly increase the ease of 
which members can purchase and sell goods. This leads to a 
decrease in transaction costs for those involved. 
[0003] Additionally, analytical applications such as 

data warehouses exist. Data warehouses are designed to 
gather data from various sources and perform very fast 
analyses of the data. Because some users or operators of 
electronic marketplaces may desire such analyses of 
transactions occurring on an electronic marketplace or of 
information present on the electronic marketplace, a 
connection tool to extract information from an electronic 
document source, such as an electronic marketplace and 
convert the format to one used by an analytical 
application, such as a data warehouse prior to forwarding 
it onto the application is needed. 



SUMMARY OF THE INVENTION 

[0004] An embodiment of the present invention provides a 

connection tool for extracting information from an 
electronic marketplace and converting the format of that 
information to that used by a data warehouse. 
[0005] Another embodiment of the present invention 



provides an electronic marketplace having a connection tool 
for extracting information from an electronic marketplace 
and converting the format of information to that used by a 
data warehouse. 

[0006] As such, it is an object of the present invention 

to permit the extraction of information on an electronic 
marketplace and convert the format of that information into 
a format utilized by a data warehouse. 

[0007] It is a further object of the present invention 

to permit an electronic marketplace to extract certain 
information therein and convert the format of that 
information into a format utilized by a data warehouse. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0008] Figure 1 is a block diagram of an connection tool 

according to an embodiment of the present invention. 
[0009] Figure 2 is a block diagram of an electronic 

marketplace having a connection tool according to an 
embodiment of the present invention. 

[0010] Figure 3 is a block diagram of an electronic 

marketplace having a connection tool according to another 
embodiment of the present invention. 
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[0011] Figure 4 is a block diagram of an electronic 

marketplace having a connection tool according to yet 
another embodiment of the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0012] The present invention will be better understood 

by reference to the accompanying drawings. 

[0013] A connection tool according to an embodiment of 

the present invention is shown in Figure 1. Connection 
tool 10 is fed data, such as XML documents, by application 
20 which may reside on an electronic marketplace or at 
another location and by electronic marketplace router 30. 
Preferably, a broker interface 35 is provided on each 
application 20 and on the marketplace router 30 for 
transporting the XML documents to the connection tool 10. 
Broker interface 35 permits marketplace integrators to call 
the connection tool regardless of how it is deployed within 
the electronic marketplace. 

[0014] Broker interface 35 can be implemented through 

the creation of a message format. This format can be based 
on MIME and specify header information and a message body 
consisting of an XML document. Broker interface 35 

supports the exchange of messages between server 4 0 and 
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application 20 or marketplace router 30 using a variety of 
transport protocols such as HTTP, HTTPS, RMI, SOAP or JMS 
and provide for guaranteed delivery of the message. Broker 
interface 35 consists of help classes and JavaMail. If 
replies are desired from the connection tool 10, a reply 
message can be handled from application 20 and marketplace 
router 30. 

[0015] The documents sent by application 2 0 or 

marketplace router 30 are received by server 40. Server 40 
manages the forwarding of incoming documents to interested 
listeners 55, 60, 65, 70, and 75 through a document 
subscription model. All listeners subscribe to a set of 
documents identified by header attributes, such as document 
source, document type and document version. Preferably, 
each listener can subscribe to multiple sets of message 
header attributes. Once a listener receives a document, it 
is responsible for the extraction of information from the 
document. Thus, information from documents is filtered 
based on header attributes, for example document type. 
[0016] Listener interface 50 provides an interface 

between server 40 and the connection tool 10 outputs. 
Listener interface 50 can be implemented through the 
creating of a Java class. During initialization of 
connection tool 10, the connection tool 10 looks up an 
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XML property file and instantiates all listeners listed in 
the XML property file. The listener objects receive the 
Server instance and also a Thread Pool object for 
performance reasons . 

[0017] An inner ThreadJob class can be created to run 

the time consuming extraction and business processes. The 
Thread Pool will execute this object to improve 
performance. The Thread Pool can be used to run the XSL 
extraction process. 

[0018] Each listener 55, 60, 65, 70 and 75 (discussed 

below) must include a listener interface 50 in order to 
communicate with server 40. Server 40 keeps track of all 
document attributes that listeners are interested in and 
knows to which listener each document will be forwarded. 
Server 40 will forward a copy of the message including the 
document to each listener interested in that particular 
document . The listener will in turn extract information 
from the XML document using XSLT. 

[0019] Deployment of these listeners 55, 60, 65, 70 and 

75 can be done via a XML property file. Essentially, this 
XML file provides a means for an administrator to deploy 
different listeners or add new document attributes to which 
the listeners are subscribing. 

[0020] Connection tool 10 should preferably provide a 
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standard document listener interface implementation so that 
a marketplace integrator will only need to write XSL to 
extract data from an XML document message and not have to 
deal with the Java implementation of document listeners 
themselves. Through this implementation, the marketplace 
operator can write and test their listener via tools like 
XMLSpy on their desktop. They can deploy the XSL files 
later by configuring the connection tool XML property file. 
[0021] Different extraction XSLT schemas can be devised 

to form the listeners that extract and convert the 
information. Purchase order listener 55, purchase order 
master data listener 60, sales order listener 65, sales 
order acknowledgement 70 and invoice listener 75 shown in 
Figure 1 are examples of such schemas. Each of these 
schemas can convert information into the format utilized by 
a data warehouse but extract information from different 
document types. Purchase order listener 55 extracts 

information from purchase order documents. Purchase order 
master data listener 60 extracts master data information 
from purchase order documents. Sales order listener 65 
extracts information from sales order documents. Sales 
order acknowledgement listener 7 0 extracts information from 
sales order acknowledgement documents. Invoice listener 75 
extracts information from invoice documents. Because each 
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listener can essentially listen to multiple document 
attributes, such as multiple document types, the listeners 
mentioned above could alternatively be deployed into a 
single listener if the end point (80, 85, 90, 95) for each 
is the same. This will reduce the number of listeners 
deployed in connection tool 10. 

[0022] If the information desired to be analyzed by the 

data warehouse relates to purchase orders placed and 
acknowledged, purchase order listener 55 and purchase order 
master data listener 60 can forward information to data 
warehouse 80 as shown in Figure 1. Sales order listener 
can output to database 85, for example. Sales order 
acknowledge 70 can be output to a third party service 90, 
for example. Invoice listener 75 can be output to an order 
management service within the electronic marketplace, for 
instance . 

[0023] The connection tool 10 can provide standard 

listeners which transform the XML document into a schema 
used by a data warehouse, such as BWXML schema for Business 
Warehouse by SAP, A.G., and to forward that converted 
document to the business warehouse or to an existing 
electronic marketplace infrastructure. 

[0024] BWXML schema is used as a "data format black box" 

to load data to the data warehouse. BWXML permits the 
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utilization of the power of XML to describe the metadata of 
data (e.g., to describe the field type, length, data 
object, etc.). With this metadata information, an XML 
receive function module on the data warehouse should be 
able to create data source automatically from BWXML if it 
is not already in existence in the data warehouse. 
[0025] The BWXML schema can be created through the use 

of a <docheader> subtree, a <structure> subtree and a 
<table> subtree. 

[0026] The <docheader> subtree structure is used to 

describe the XML document original. This subtree consists 
of <datasource>, <docsource>, <doctype> and <docver>. 
<datasource> is used to indicate the data warehouse data 
source to which the information is destined. <docsource> 
is used to describe the XML schema source or infrastructure 
source. <doctype> is used to indicate the document type 
(e.g., purchase order, sales order, opportunity, etc.). 
<docver> is used to indicate the document type version 
number. 

[0027] The <structure> subtree is the metadata for the 

data of the <table> subtree. The <structure> subtree 
describes how the data structure should look. In theory, 
this subtree structure allows the data warehouse's XML 
receiver function module to create a datasource 
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automatically. 

[0028] The <table> subtree will contain the actual 
extracted data from the XML documents (e.g., 
purchaseorder/salesorder) <row></row> will be a single 
record so to speak and the total number of 
<f ieldvalue></f ieldvalue> pairs should be the same as what 
is described in <structure> subtree. 

[0029] A few different embodiments of electronic 
marketplaces having connection tools according to the 
present invention are now presented. Certain of the 
embodiments are better suited for use under certain 
circumstances as will be described below. 

[0030] Figure 2 depicts an electronic marketplace having 

a connection tool according to an embodiment of the present 
invention. Electronic marketplace 100 connects user 110 to 
user 120 to enable business-to-business electronic commerce 
between them. During such a transaction, an XML document 
from user 110 flows into electronic marketplace 100 and to 
master router 130. Master router 130 can route the 
document to any of a number of business services 14 0. 
Connection tool 10 can be connected to master router 130 so 
that it extracts and forwards data to the data warehouse 
150 transparent from the business services 140 and user 
applications existing at users 110 and 120. Data warehouse 
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150 can be located within electronic marketplace 100, as 
pictured, or alternatively, external to electronic 
marketplace 100. Under the embodiment of Figure 2, it is 
easy to manage deployment of XSL listeners within 
connection tool 10 because connection tool 10 exists in 
only one place on electronic marketplace 100. 
[0031] Figure 3 shows another embodiment of an 

electronic marketplace having a connection tool 10. 
Electronic marketplace 200 connects user 110 to user 120 to 
enable business-to-business electronic commerce between 
them. User 110 is outfitted with connection tool 10-3. 
When user 110 is transferring a document to marketplace 130 
in which data warehouse 150 is interested, the document can 
be converted by connection tool 10-3 prior to it being 
transferred to electronic marketplace 200. Thus, the 
document being sent to electronic marketplace 200 is in the 
format used by data warehouse 150 and can be encrypted 
prior to sending and decrypted after data extraction by 
data warehouse 150. Because it is already in the format 
used by data warehouse 150, it can be sent directly to data 
warehouse 150 without the need to convert the format of the 
document through router 130 via lines 112 and 115 . 
Alternatively, a direct line can connect connection tool 
10-3 with data warehouse 150 bypassing router 130. If the 
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document is not one in which data warehouse is interested, 
it can be sent via line 116 without being converted by 
connection tool 10-3. 

[0032] Connection tool 10-2 is provided to convert 

documents being handled by business services 140 in which 
data warehouse 150 is interested. These documents can flow 
through connection tool 10-2 to marketplace router 130 via 
line 136 and on to data warehouse 150 through line 115. 
Alternatively, a direct connection from connection .tool 10- 
2 to data warehouse 150 can be provided. Line 133 connects 
business services 140 to router 130 for documents in which 
data warehouse is not interested. 

[0033] As in the embodiment of Figure 2, connection tool 

10-1 can be connected to master router 130 so that it 
extracts and forwards data to the data warehouse 150 that 
has not yet been converted by another connection tool. 
Also as in the embodiment of Figure 2, data warehouse 150 
can be located internally or externally to electronic 
marketplace 100. Line 138 connects router 130 to user 120. 
[0034] Figure 4 shows yet another embodiment of an 

electronic marketplace having a connection tool. 
Electronic marketplace 300 connects user 110 to user 120 to 
enable business-to-business electronic commerce between 
them. During such a transaction, two copies of an XML 
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document 111-1 and 111-2 are sent from user 110 into 
electronic marketplace 300 and on to master router 130. 
Copy 111-2 is sent with connection tool 10 listed as the 
intended recipient and thus is routed to connection tool 10 
by master router 130. Connection tool 10 extracts, 
converts and forwards data to the data warehouse 150. As 
with the embodiments of Figures 2 and 3, data warehouse 150 
can be located internally or externally to electronic 
marketplace 100. Connection tool 10 is connected to master 
router 130 as an addressee. Under the embodiment of Figure 
4, like the embodiment of Figure 2, only a single 
connection tool is necessary and thus XSL listeners only 
need to be deployed in one location, simplifying 
maintenance. Also the applications at users 110 and 120 do 
not need to be connection tool-aware, but just treat 
connection tool 10 as a business service. 

[0035] Although the preferred embodiments of the present 

invention have been described and illustrated in detail, it 
will be evident to those skilled in the art that various 
modifications and changes may be made thereto without 
departing from the spirit and scope of the invention as set 
forth in the appended claims and equivalents thereof. 
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